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ABSTRACT 

We study the luminosity gap, Ami2, between the first and second ranked galaxies in a 
sample of 59 massive (^ 10^^ Mq) galaxy clusters, using data from the Hale Tele- 
scope, the Hubble Space Telescope (HST), Chandra, and Spitzer. We find that A7tt,i2 dis- 
tribution, p(Ar7ii2), is a declining function of Ar7?,i2, to which we fitted a straight line: 
p{Ami2) oc —(0.13 ± 0.02)A7TT,i2. The fraction of clusters with "large" luminosity gaps 
is p(A77ii2 > 1) = 0.37 ± 0.08, which represents a 2>cf excess over that obtained from 
Monte Carlo simulations of a Schechter function that matches the mean cluster galaxy lumi- 
nosity function. We also identify four clusters with "extreme" luminosity gaps, Ar7ii2 > 2, 
giving a fraction of p(Ami2 > 2) = 0.07too3- More generally, large luminosity gap clus- 
ters are relatively homogeneous, with elliptical/disky brightest cluster galaxies (BCGs), cuspy 
gas density profiles (i.e. strong cool cores), high concentrations, and low substructure frac- 
tions. In contrast, small luminosity gap clusters are heterogeneous, spanning the full range of 
boxy/elliptical/disky BCG morphologies, the full range of cool core strengths and dark mat- 
ter concentrations, and have large substructure fractions. Taken together, these results imply 
that the amplitude of the luminosity gap is a function of both the formation epoch, and the 
recent infall history of the cluster "BCG dominance" is therefore a phase that a cluster may 
evolve through, and is not an evolutionary "cul-de-sac". We also compare our results with 
semi-analytic model predictions based on the Millennium Simulation. None of the models 
are able to reproduce all of the observational results on Ami2, underlining the inability of 
the current generation of models to match the empirical properties of BCGs. We identify the 
strength of AGN feedback and the efficiency with which cluster galaxies are replenished after 
they merge with the BCG in each model as possible causes of these discrepancies. 

Key words: galaxies:clusters:general - galaxies:elliptical - galaxies:halos - X-ray:galaxies 
- X-rays:galaxies:clusters - gravitational lensing 



1 INTRODUCTION 

Numerical simulations and large-scale redshift surveys both indi- 
cate that we live in a hierarchical universe, i.e. one in which the 
large-scale structure of the universe grows from the bottom up with 
smaller objects forming earlier than larger objects. This picture 
rests on the matter content of the universe being dominated by col- 
lisionless dark matter particles, smoothly distributed at early times, 
and seeded with small density perturbations. Exploring this picture 



observationally in the non-linear regime of gravitational collapse, 
i.e. within collapsed dark matter halos that host individual galaxies 
through to massive clusters of galaxies, complements the statisti- 
cal analysis of the linear regime probed by galaxy redshift surveys. 
Indeed, a promising route to fleshing out our understanding of hi- 
erarchical structure formation is to measure observable quantities 
that are sensitive to the age and/or assembly history of dark mat- 
ter halos, and thus in principle to test the hierarchical paradigm 
by comparing the observed and predicted distributions. Any dis- 
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crepancies found between observation and theory may ultimately 
point to modifications to the theoretical model including, for exam- 
ple, the properties of the dark m atter particle a nd the distribution 
of initial density fluctuations (e.g. lKomatsu et al . 2009). Quantities 
discussed in the literature that may be useful probes of the age and 
assembly history of dark matter halos include the luminosity gap 
between the first and second ranked galaxies in a group or clus- 
ter (often expressed as th e difference be tween their magnitudes, 
Ami2 = mi — 771,2; e. g. iDariushet alj | 2007|). the concentr ation 
of dark matter halos (e.g. lNeto et alj|2007l : l0kabe et al.ll2010l). and 
the sub-halo po pulation of dark matter halos (e.g. iTavlor & Babull 
l2004IZentnere t al. 2005). 

The luminosities of the first and second ranked 
galaxies in clusters was fir st studied, as far as we are 
aware , by Sandage & Hardvl ( 119731 see also Gelleretal.1 



19761; iT remaine et al.' 1 19771 : lOstriker & Hausmad 1 19771 : 



Oergerle & Hoessel 1989J). More recently the luminosity gap. 



Ami2, was studied in the context of galaxy groups, the term "fos- 
sil" being coin e d to d e scribe virial i zed sy st ems w ith A777i2 > 2 
jPonman et all 1 1994 Ijones et alj l2000l l2003h . L* galaxies 
are absent from fossil groups, which were thus interpreted as 
having formed at early times, with dynamical friction then 
having sufficient time to cause the L* galaxy population to 
merge and form the brightest group galaxy (BGG). Fossil 
groups are expected to be more common than fossil clusters, at 
least in part because the probability of galaxy-galaxy merging 
is anti-correlated with galaxy velocity, and thus with cluster 
mass. Nevertheless, two clusters with masses of ^ ^^^^_ -^Q 
have been found with A777i2 > 2 ( Khosrosha hi et alj 120061 : 
iMendes de Oliveira et ai]|2006l : ICvpriano et al.. 200^. These two 
objects (RXJ1416.4+2315, and RXJ15552.2+2013) raise the 
interesting question of whether the most massive (^10^^ M©) 
clusters might host similarly dominant BCGs. Theoretical studies 
suggest th at this is the case, for example, iMilosavlievic et al.l 
(2006) and lDariushetal.l (2007) predict that ~ 1 - 3% and ~ 5% 
of 10^^ Mq clusters have A7/112 > 2 respectively. 

The concentration parameter of a dark matter halo describes 
the shape of its de nsity profile followi ng the so-called universal 
profile proposed by Navarro et alj ( 119970 and variants thereon. Ha- 
los with smaller concentrations have a flatter density profile, while 
larger concentrations imply a steeper density profile. iBullock et al.l 
( 1200 W analyzed numerical simulations of CDM universes finding 
a weak dependence of concentration on halo mass: c oc M" with 
Q ~ -0.1 (see also Dola g et alj2004l : lNeto et alj2007l : lDuffv etal] 
120081) . This relationship arises from the relative timing of the for- 
mation of dark matter halos as a function of mass. On average less 
massive halos form at earlier times than more massive halos in a 
hierarchical universe. At earlier times the universe was denser than 
at later times, and thus the central regions of less massive halos are 
relatively dense, leading to higher concentration parameters than 
for more massive halos. Some observational studies have reported 
very high concentration parameters in individual systems, for ex- 
ample, one of the fossil groups studied by Khosroshahi et al. ( 2004 , 
I2OO61 was found to have c > 50, based on modeling of X-ray 
observations. Lensing studies of several individual strong-lensing 
clusters have also obtained very high concentra tions of c ^ 10 in 
cont rast to the theoretical prediction of c ~ 5 ( Kneib et al. 2003: 
iGava zzi 2003: Broadhurst et al. 2005: Limousin et al. 2007). More 
recently, observational studies have begun to study larger samples 
and thus to constrain the concentration-mass relation itself, and to 
probe the general populati on rather than a small number of pot en- 
tially extreme objects (e.g. lBuote et al.ll2007l : IOkabe et alj2010l) . 



Theoretically, substructures within dark matter halos, i.e. the 
sub-halo population, are also sensitive to the assembly history of 
the host dark matter halo (e.g. lTavlor & Babulll2004l : IZentner et al.l 
20053. Observationally substructures in galaxy clusters can be iden- 
tified v ia detailed modeling of the observed gravitatio nal lensing 
signal dSmith et alj|2005l l2009l : iRichard et alj|2010j|bl) . Specifi- 
cally, group- and galaxy-scale perturbers are required to achieve 
statistically acceptable fits to the strong-lensing data. The contri- 
bution of these structures to the total cluster mass is quantified via 
the "substructure fraction", /sub, defined as the amount of mass 
within the adopted cluster-centric radius that is assigned to sub- 
struc tures divided by the to tal cluster mass within t he sam e aper- 
ture. Smith & Taviorl ( l2008h combined Ismith et alj 's ^QQ^) obser- 
vational measurements of /sub fo r 10 X-ray luminous galaxy clus- 
ters with lTavlor & BabulT s <2004h semi-analytic model of structure 
formation to explore the interpretation of lensing-based measure- 
ments of /sub . The main conclusion was that /sub depends on both 
when the cluster formed, and on the level of recent mass assem- 
bly, each defined as the lookback time to when each cluster had 
acquired 50% and 90% of their observed mass respectively. For ex- 
ample: clusters at z = 0.2 with /sub < 0.1 formed on average 
at 2 > 0.8, and suffered < 10% mass growth since z — 0.4: in 
contrast, clusters at 2: — 0.2 with the highest substructure frac- 
tions (/sub > 0.4) formed on average since z ~ 0.4 and acquired 
^ 10% of their mass between z — 0.25 and z = 0.2, i.e. a time 
interval of just 500Myr. 

A complementary view of hierarchical merging within galaxy 
clusters is available from BCG morphology. Based on their isopho- 
tal shapes , elliptical ga laxies have been classified as disky or 
boxy (Ben der et al.l 19891) - with positive and negative fourth-order 
Fourier coefficients respectively. The interpretation of boxy and 
disky isophotes in terms of the details of galaxy merger histories 
is a controversial subject (IFaber et al.ll 19971 : iNaab & Burkertll2003l : 



iKhochfar & Burkertl2005h . In this study we will side-step these dif- 
ficulties, and concentrate simply on disky/boxy isophotes as an in- 
dicator of the presence of gas that has dissipated, and settled into a 
disk-like structure, either because the last massive galaxy to merge 
with the BCG was gas rich, or because gas has been accreted by 
the merger product from its environment, e.g. by a BCG in a cool 
core cluster. BCGs are the most massive early-type galaxies and 
are generally expected to have boxy isophotes consist ent with for- 
mation via mergers of early-type (gas - poor) galaxies ( iLin & Mohn 
|2004 iKhosroshahi. Ponman & JonesI l2006h . However, BGGs in 
some fossil groups a re as bright as B CGs and do not have boxy 
isophotes ( Khosroshahi. Ponman & Jones 2006) , suggesting that 
(i) fossil BGGs may form early from the mergers between gas-rich 
spiral galaxies, and (ii) some fossil BGGs may have subsequently 
evolved into BCGs. 

The main aim of this article is to combine measurements of 
the luminosity gap, cool core strength, concentration, substructure 
fraction, and BCG isophotal shape for a large sample of clusters to 
assemble an empirical picture of the hierarchical assembly of clus- 
ters and their BCGs. We present the first observational measure- 
ment of the distribution of the luminosity gap statistic of 10^^ M© 
clusters, and compare this distribution with the other probes of hi- 
erarchical assembly discussed above. This allows us to build an 
empirical picture with which to assess the usefulness of the respec- 
tive measurements for assessing the age of clusters. We also inves- 
tigate how well the current generations of galaxy formation models 
can reproduce the observed luminosity gap distribution. The data 
used for this study are drawn from the Local Cluster Substructure 
Survey (LoCuSS: PI: Smith: http://www.sr.bham.ac.uk/locuss). A 
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summary of LoCuSS is provided in ij2l togetlier with a description 
of the data used in this paper. The analysis and results are then pre- 
sented in ij3] and compared with theoretical predictions in SJD The 
main conclusions are summarized and discussed in ij5] We assume 
ii"o=70 km s"^ Mpc~\ r^iif =0.3 and f^A^O. 7 throughout. In this 
cosmology 1" corresponds to a physical scale of 3.3kpc, at z—0.2. 
All photometric measurements are relative to Vega. 



2 DATA 

2.1 Sample Selection and Observing Strategy 

LoCuSS is a morphologically-unbiased multi-wavelength survey of 
X-ray luminous galaxy clusters at 0.15 < z < 0.3. The over- 
all aim of the survey is to measure the cluster-cluster scatter in 
key observables such as the X-ray temperature, and Yx par ame- 
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the o bscured and unobscured star formation activity JHaines et al.l 
l2009ab. 2010; Smith etal. 2010; Pereiraetal. 201(|), and to cor- 
relate this scatter with the structure and thus hierarchical assembly 
history of the clusters. The back bone of the survey is the gravi- 
tationa l lensing analysis of HST JSmith et al.ll2005l ; iRichard et al.l 
uOlO b. Hamilton-Morris et al. in prep.; May et al. in prep.) and Sub- 
aru lOkabe et al. 2010; Oguri et al. 2010) imaging data, because the 
lensing-based mass maps can be used to inf er the likely assembly 
history of the clusters CSmith & Tavloj|2008l) . 

The parent sample for this study comprises 115 clusters sat- 
isfying -27°<(5<70°, 0.15<2<0.3, ng<7 xl0^°cm~^ drawn 
from the ROSAT All-sky S urvey catalogs ( Ebeling et all 1 19981 . 
I2OOOI ; iBohringer etai]|2004) . The cut at S = -27° ensures that 
the clusters are observable from Palomar Observatory at elevations 
above 30° (an airmass of sec 2 < 2). The clusters span a decade 
in X-ray luminosity in the 0.1 - 2.4keV band: 2 x lO""' <. Lx <, 
20 X lO^^ergs"^ (Fig. [Hi, which co rresponds to a mass range of 
5 X 10^* < A/viriai < 3 X 10^^ M0 JReiprich & B6hringeij|2002l) 
- i.e. well-matched to 10^^ M©. 

The radius at which the mean enclosed density of a 10^^ Mq 
dark matter halo at the median redshift of the cluster sample (z = 
0.22) is (p(< r)) = 200pcrit is r2oo = 1.9Mpc. To ensure that 
our results are comparable with previous studies of the luminosity 
gap statistic, data that probe out to ^ 0.5r2oo (^ 4.5 arcmin) are 
required. This requirement is met by the Wide-field Infrared Cam- 
era (WIRC) on the Hale 200in Telescope at Palomar Observatory 
( fjl.H . Traditionally, the luminosity gap statistic has been studied at 
optical wavelengths. In contrast, working in the near-infrared per- 
mits the use of (J — K) colours as a surrogate for a photometric 
redshift estimate of cluster galaxies ( i|3.1l ). taking advantage of the 
relative insensitivity o f near-infrared colours to spectral type (e.g. 
iMaimucci et al.ll200lh . This is vital, in the absence of exhaustive 
spectroscopic catalogs, to weed out non-cluster members when cal- 
culating the luminosity gap statistic. 

2.2 Ground-based Near-infrared Data 

The parent sample of 1 15 clusters were used as a back-up observ ing 
program during observing runs with WIRC JWilson et alj2003h on 
the Hale 200in Telescopq^ during observing runs spanning April 

^" The Hale Telescope at Palomar Observatory is owned and operated by 
the California Institute of Technology. 



^ o 
^ X 

CN CN| 

T— in 

o o 



T 



®* o 
® ® 



®3® 
I® ®® ® 

Ot» 

o «# » 

3 ® 

dh ^ o 



h8 



-® & -e - 



0.15 



0.2 



0.25 



0.3 



Redshift 



Figure 1. The distribution of the parent sample of 115 clusters in the 
Lx —redshift plane. Filled data points indicate clusters that we observed in 
acceptable conditions (see i|2.2t with WIRC on the Hale 200-in telescope, 
and are colour-coded as follows: black - also observed with both HST and 
Chandra; blue - also observed with HST, but not with Chandra; green - 
also observed with Chandra, but not with HST; red - observed with nei- 
ther HST nor Chandra. Open data points were not observed with WIRC at 
Palomai; and therefore do not form pail of the sample studied in this article. 
The dashed lines delineate the volume limited samples against which the 
observed sample is compared statistically in i|2.5| The absence of clusters 
from the parent sample to the lower right is caused by the flux-limit of the 
ROSAT All-sky Survey. 
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Figure 2. The (J — K)/ K colour-magnitude relation for A 1763. The hor- 
izontal lines show the region within which likely cluster galaxies were se- 
lected. The filled circle denotes the BCG. 



2004 to luly 2005. Data were acquired when the full width half 
maximum (FWHM) of point sources exceeded 1 arcsec. In total, 
data were obtained on 78 clusters, with no pre-selection on cluster 
properties other than the X-ray selection described above (i ]2.1| (. 
Each cluster was observed with a single 8.7' x 8.7' WIRC point- 
ing. BCGs at 2:~0.2 have a typical angular extent of ~larcinin; 
the individual exposures were therefore dithered within a box of 
full-width 80" to minimise inclusion of BCG flux in sky-flats con- 
structed from the science data. Each cluster was observed for a total 
of 600 sec per filter, split into 5 dither positions. 

The data were reduced in a uniform and standard manner us- 
ing an automated pipeline of IRAF tasks to dark subtract, flat-field, 
align, and co-add the individual frames at the telescope. Data ac- 
quired in conditions worse than FWHM = 1.5" suffered strongly 
variable transparency and/or non-uniform background, and were 
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Table 1. The observed sample of clusters. 



Cluster 


a, 5 [J2000] 


Redshift 


Mk,bcg'' 


Ami2°- 


HST PID 


Also known as 


A 68 


00 37 05.28 +09 09 10.8 


0.255 


-26.65 


0.25 


8249 




A115 


00 55 50.65 +26 24 38.7 


0.197 


-26.29 


0.39 


11312 




A141 


0105 37.17-24 40 49.7 


0.230 


-26.54 


0.40 


10881 


RXCJ0105.5-2439 


ZwCl 0104.4+0048 


0106 49.50+0103 22.1 


0.255 


-26.37 


0.52 


11312 


Z348 


A 209 


013153.00-13 36 34.0 


0.206 


-26.76 


0.87 


8249 


RXCJ0131. 8-1336 


A 267 


0152 48.72+0101 08.4 


0.230 


-26.56 


1.43 


8249 


RXCJ0152.7+0100 


A 291 


020143.11 -02 1148.1 


0.196 


-25.49 


1.79 


8301 


RXCJ0201. 7-0212 


A 383 


02 48 02.00 -03 32 15.0 


0.188 


-26.24 


1.76 


8249 


RXCJ0248.0-0332 


RXCJ0331. 1-2100 


03 3105.87-2100 32.7 


0.188 


-27.07 


0.96 


10881 




A521 


04 54 06.88-10 13 24.6 


0.247 


-26.47 


0.00 


11312 


RXCJ0454. 1-1014 


A 586 


07 32 20.42+3137 58.8 


0.171 


-26.31 


0.51 


8301 




ZwCl 0740.4+1740 


07 43 23.16+17 33 40.0 


0.189 


-26.26 


1.62 


11312 


Z1432 


A611 


08 00 55.92 +36 03 39.6 


0.288 


-26.90 


1.28 


9270 




A 665 


08 30 57.36+65 51 14.4 


0.182 


-25.81 


0.66 






ZwCl 0839.9+2937 


08 42 56.06 +29 27 25.7 


0.194 


-26.45 


0.67 


11312 


Z1883 


ZwCl 0857.9+2107 


09 00 36.86 +20 53 40.0 


0.235 


-25.82 


0.34 


8301 


Z2089 


A 750 


09 09 12.74+10 58 29.1 


0.163 


-26.52 


0.95 


11312 




A 773 


09 17 54.00+5142 57.6 


0.217 


-26.68 


0.47 


8249 




ZwCl 0923.6+5340 


09 27 10.69 +53 27 30.9 


0.205 


-25.86 


0.55 




Z2379 


ZwCl 0949.6+5207 


09 52 47.52+5153 27.6 


0.214 


-26.45 


1.24 


8301 


Z2701 


A 901 


09 56 26.40-10 04 12.0 


0.163 


-25.90 


0.08 


10395 


RXCJ0956.4-1004 


RXJlOOO.5+4409 


10 00 31.16+44 08 42.5 


0.153 


-25.72 


0.35 


10881 




A 963 


10 17 01.20 +39 01 44.4 


0.205 


-26.42 


1.73 


8249 




A 1201 


11 12 54.61+13 26 08.2 


0.169 


-26.25 


1.37 


8719 




A 1204 


11 13 20.55+17 35 39.1 


0.171 


-25.80 


0.95 


8301 




A 1246 


1123 58.83+2128 45.4 


0.190 


-25.92 


0.42 


8301 


RXCJl 123.9+2129 


A 1423 


1157 17.43+33 36 38.6 


0.213 


-26.16 


1.77 


8719 




A 1553 


12 30 48.95+10 32 45.6 


0.165 


-26.82 


0.94 






ZwCl 1231.4+1007 


12 34 17.45+09 45 58.1 


0.229 


-26.21 


0.71 


8719 


Z5247 


A 1634 


12 54 01.84-06 42 14.4 


0.196 


-25.82 


0.61 




RXCJ1254.0-0642 


A 1682 


13 06 47.89 +46 33 32.5 


0.226 


-26.96 


0.09 


8719 




ZwCl 1309.1+2216 


13 1146.15+22 0136.8 


0.266 


-26.18 


2.17 




Z5768 


A 1704 


13 14 24.38+64 34 31.0 


0.220 


-26.34 


1.41 






A 1758 


13 32 44.47 +50 32 30.5 


0.280 


-26.26 


0.22 






A 1763 


13 35 16.32 +40 59 45.6 


0.228 


-26.84 


1.59 


8249 




A 1835 


14 01 02.40 +02 52 55.2 


0.253 


-27.32 


2.44 


8249 




A 1914 


14 25 59.78+37 49 29.1 


0.171 


-26.62 


1.33 


8301 




A 1961 


14 44 31.85+31 13 34.3 


0.234 


-26.47 


0.31 






A 1994 


14 56 13.48 -05 48 56.6 


0.220 


-26.45 


0.55 




RXCJ1456.3-0549 


MS 1455.0+2232 


14 57 15.23 +22 20 34.0 


0.258 


-26.06 


0.11 


8301 


ZwCl 1454.8+2233, Z7 160 


A 2009 


15 00 19.63 +21 22 08.9 


0.153 


-25.99 


0.17 


8301 




ZwCl 1459.4+4240 


15 0123.13+42 20 39.6 


0.290 


-26.28 


0.07 




Z7215 


A2111 


15 39 40.51 +34 25 27.0 


0.229 


-25.58 


0.37 






A 2146 


15 56 09.05+66 2133.1 


0.234 


-26.07 


0.41 


8301 




A2163 


16 15 34.10-06 07 26.0 


0.169 


-25.46 


0.49 






A 2204 


16 32 46.94+05 34 31.3 


0.152 


-25.82 


0.11 


8301 




A2218 


16 35 52.80 +66 12 50.4 


0.171 


-26.12 


0.32 


5701 




A2219 


16 40 22.56+46 42 21.6 


0.228 


-26.62 


1.18 


6488 




A 2254 


17 17 45.96+19 40 48.0 


0.178 


-26.05 


0.88 


8301 




RXJ1720.1+2638 


17 20 10.14+26 37 30.9 


0.164 


-26.37 


1.76 


11312 




A 2261 


17 22 27.24 +32 07 56.7 


0.224 


-26.34 


2.32 


8301 




RXCJ2102.1-2431 


2102 09.98-24 32 01.8 


0.188 


-26.82 


2.04 






A 2345 


2127 13.73-12 09 46.1 


0.176 


-26.67 


1.09 


11312 


RXCJ2127. 1-1209 


RXJ2129.6+0005 


2129 40.02+00 05 20.9 


0.235 


-26.78 


1.93 


8301 




A 2390 


2153 36.72+17 41 31.2 


0.233 


-26.21 


1.50 


5352 




RXCJ221 1.7-0350 


22 1145.95-03 49 45.3 


0.270 


-26.35 


1.71 






A 2485 


22 48 31.13-16 06 25.6 


0.247 


-26.59 


0.00 


11312 


RXCJ2248.5-1606 


A 2537 


23 08 23.20-02 1131.0 


0.297 


-26.19 


0.63 


9270 


RXCJ2308.3-2011 


A 2631 


23 37 39.82 +00 16 16.9 


0.278 


-26.35 


0.64 


11312 


RXCJ2337.6+0016 



" Uncertainties on M^ bcg ^nd Ami2 are dominated by the uncertainties on the photometric calibration, which is ~ 0.1 mag in J- and K-bands. 
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therefore excluded from the analysis, leaving a total of 59 clus- 
ters with good quality data (TablelTJ. Astrometric and photometric 
calibration were achieved by reference to the 2MASS catalogs, to 
root mean s quare (rms) preci sions of 1 arcsec and 0.1 magnitudes 
respectively dStott et alJuOOSl) . The results described in this article 
are insensitive to the uncertainty on the photometric calibration. An 
example {J—K)/K colour magnitude diagram is shown in Fig.[2] 
The typical depth reached by the data is A'~17; an L* galaxy has 
A'^IS and a typical BCG has i^~12-13 at z~0.2. 



2.3 Hubble Space Telescope Observations 

Hubble Space Telescope (HSTr^ imaging data are available 
through a broad red filter (F606W, F702W, and/or F814W) for 45 
of the 59 clusters (Table [T]( of which 13 are drawn from new Lo- 
CuSS ACS (PID:10881) and WFPC2 (PID:11312) observations. 
The reduction of the data on 10 clusters o bserved unde r PID:5 701, 
PID:6488 and PID:8249 is described by ISmith et al] ( l2005h . Of 
the remaining 36 clusters, the 18 with WFPC2 data (PIDs:5352, 
8301, 8719, 11312) were all reduced onto a 0.1" pixel scale us- 
ing WFIXUP, WMOSAIC, IMSHIFT and CRREJ tasks within IRAF 
to clean, register and combine the individual exposures. Details of 
the reduction of the remaining clusters observed with ACS are de- 
scribed by Hamilton-Morris et al. (2010, in preparation). 

2.4 Chandra X-ray Observations 

Chandra X-ray observations are available for 41 of the total sam- 
ple of 59 clusters. Th e reduction and analys is of these data are 
described in detail by ISanderson et alj J2009f) . In brief, for each 
cluster, an annular spectral profile was extracted and used to de- 
project the X-ray emission to measure the gas density and tem- 
pe rature in spherical shells. T he phenomenological cluster model 
of lAscasibar & Diegd ( 120081) was then jointly fitted to the tem- 
perature and density profiles to determine the mass profile, assum- 
ing hydrostatic equilibrium, following the procedures d escribed in 
I Sanderso n & PonmanI 1 120101) . The model is based on a Hemquisj 
( 1990) density profile, which yields larger scale radii (and corre- 
spondingly lower mass concentrations) than th e commonly-used 
NFW profile. Following ISanderson et al.l ( 120091) , we also use the 
logarithmic slope o f the gas density profile at 0.04 rsoo (a; 
IVikhlinin et al.ll2007l) as an indicator of cool core strength, which 
has also been shown to correlate with the s ubstructure fraction o f 
cluster cores, based on strong lens models jRichard et alj|2010bh . 
A more negative value of a indicates a steeper central gas density 
profile, and thus a stronger cool core, and vice versa. 



2.5 Statistical Comparison of Sub-samples 

Incomplete coverage of the parent sample of 115 clusters with 
WIRC, and heterogeneous coverage of the WIRC-observed clus- 
ters with other facilities (Fig. [T]! may introduce subtle biases into 
our results. We therefore compare statistically the various observed 
sub-samples, including for completeness the sub-sample for which 
Spitzer data are available ( i]3.2t . Specifically, the cluster X-ray lu- 
minosities are compared, after correction for the modest redshift 



^^ Based on observations with the NASA/ESA Hubble Space Telescope 
obtained at the Space Telescope Science Institute, which is operated by the 
Association of Universities for Research in Astronomy, Inc., under NASA 
contract NAS 5-26555. 



Table 2. Statistical Comparison of Sub-samples 



Sample 



•^Vclu 



(logio(Lx))'' 



All clusters observed with WIRC 
Clusters observed with WIRC & HST 
Clusters observed with WIRC & Chandra 
Clusters observed with WIRC & Spitzer 
Mean of 100,000 samples drawn randomly 
from volume-limited sample 



59 


44.71±0.03 


45 


44.71±0.03 


41 


44.76 ±0.04 


39 


44.73 ±0.03 


59 


44.71±0.03 



" The uncertainties are errors on the mean X-ray luminosity of each sample, 
with the exception of the last row, in which we quote the standard deviation 
of the 100,000 samples around the mean luminosity of all of these randomly 
drawn samples. 

evolution within the sample due to the expansion of the universe: 
Lx,. = LxE(z)-''\ whsisE(z) = H{z)/Ho = [f^M(l + 
zf + f^A]"-^ foUowing lEvi-ard et al.l ( l20o3) . 

The mean X-ray luminosity of the full sample of 59 clus- 
ters is statistically indistinguishable from the mean luminosity of 
the sub-samples observed at other wavelengths (Table|2]l. We also 
draw 100,000 samples of 59 clusters at random from the com- 
bined volume-limited samples defined by 0.15 < z < 0.2, 2.2 x 
10''''ergs"^ < Lx,z < 4.2 x 10'"'ergs-\ and 0.15 < z < 0.3, 
Lx,z > 4.2 X 10'''*ergs~^ (see Fig.[T). The average X-ray lumi- 
nosity of the observed sample of 59 clusters is well within one stan- 
dard deviation of the average X-ray luminosity of these randomized 
samples (Table|2j. We therefore conclude that the Lx distributions 
of the full sample of 59 clusters, the sub-samples observed with 
other telescopes, and the volume limited-sample defined above, are 
all statistically indistinguishable from each other. We therefore ex- 
pect any biases to be negligible, and that our results can be treated 
as comparable with those that would be achieved with a volume- 
limited sample. We also take care to double check that the Ami2- 
distributions of the various observational sub-samples are statisti- 
cally indistinguishable from each other in ij3] 



3 ANALYSIS AND RESULTS 

3.1 Source Detection and Pliotometry 

The J- and JC-band fram es were analyzed with SExtrac- 
tor ( Berlin & AmoutsI 119961) , extracting all objects subtending 
>25pixel at S/N > 2.5pbc;el~'^. The resulting catalogs were 
matched using a search radius comparable with the seeing disk, and 
point sources were excluded based on the stellarity index calculated 
by SExtractor. In the absence of spectroscopic redshift information 
we rely on the red ridge line of galaxies seen in the (J — K)/K 
colour-magnitude diagrams for each cluster (e.g. Fig. |2j to iso- 
late likely cluster galaxies. A s imple model based on redshifting 
local galaxy spectral templates dKing & EUiJI 19851) confirms that 
the (J — K) colour of galaxies varies by <. 0.2mag between 
E/SO and Scd spectral types. We therefore selected galaxies within 
±0.2 magnitudes of the BCG colour in each cluster as likely cluster 
members (see horizontal lines in Fig.|2j. 

The extended envelope of the BCGs typically spans a diam- 
eter of ~ larcmin in the HST frames. In contrast, BCGs typi- 
cally span just ~ 20 — 30" in the near-infrared frames. The dif- 
ference is due to the brighter sky in the near-infrared relative to 
the optical. We therefore use the deep F 702W HSTfWFPC l data 
available for 10 clusters in our sample dSmith et al.ll2005b to es- 
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Figure 3. Distribution of the observed luminosity gap (black points 
and solid line). The gray filled histogram is the expected distribution if 
the galaxies a re dra wn at random from a Schechter function following 
iDariush et al] i2007h (see 3X2] for more details). The dashed, dot-dashed, 
and dotted histograms show the Am,i2-distributions of the sub-samples of 
clusters for which HST, Chandra, and Spitzer data ai'e available. 



timate the K-b&nA flux lost due to the bright K-bwiA sky, un- 
der the assumption that the (i?702 — K) colour of BCGs does 
not vary significantly with radius on large scales. This assump- 
tion introduces negligible systematic uncertainty into our results 
because colour gradients in elliptical galaxies are measured to be 
d{R - K)/d{\ogr) - 0.3 - 0.4 (LaBarberaet al. 2004, 2010), 
which translates into a possible ~ 0.1 magnitude systematic er- 
ror on the factor of 2 radial corrections to the A'-band photometry 
estimated below. 

After masking out other galaxies from the data, the BCG 
R702- and ii'-band light distributions are modelled using ELLIPSE 
in the STSDAS package in IRAF. The i?702-band model is then used 
to extrapolate the A"-band light distribution out to 2a above the 
mean local background. The same procedure was applied to a sam- 
ple of L* galaxies detected in the WFPC2 frame of each of these 
ten clusters. This analysis revealed that reliance on solely A'-band 
data causes the the total flux of BCGs to be under-estimated by 
~ 0.3 — 0.7mag, with a median of ~ 0.45mag. This effect is much 
less severe for non-BCG's, with total flux being under-estimated by 
~ 0.07 — 0.15mag, with a median of O.lmag. We fit a straight- 
line to these data: AK — a + 13 K, obtaining a = —1.67 ± 0.43 
and /3 = 0.11±0.03. The correction, A K, was then applied to 
all galaxies within our sample. The amplitude of this systematic 
correction to the luminosity gap statistic measurements is therefore 
/3Ami2 and is typically in the range ~ — 0.3 mag with an uncer- 
tainty of ~ 25%, both of which are smaller than the bin-width in 
our subsequent analysis. Our results are therefore not significantly 
affected by the uncertainties on this correction. 



3.2 Luminosity Gap Statistic of 10^"" M0 Clusters 

In the absence of models of the mass distribution, and thus mea- 
surements of r2oo for all clusters in the sample we adopt a fixed 
projected physical radius of i? = 640kpc within which to calcu- 
late Am 12 for each cluster. This aperture fits comfortably within 
the observed field of view for all clusters, and corresponds to 



~ 0.4r2oo for a Afviriai ^ 10^ Mq cluster at z = 0.2. The distri- 
bution of the luminosity gap statistic is shown in Fig.|3] p( Ami2) is 
a declining function of Ami2. We therefore fit a straight-line to the 
data: p(Ami2) = A + BAmi2, weighting the data-points by a^^ 
where a is the Poisson uncertainty on A7ni2 in each bin. The best- 
fit parameter values ai'e: A = 0.36 ± 0.03 and B = -0.13 ± 0.02. 
We also measure the fraction of "fossil clusters": a total of 4 clus- 
ters have A7TI12 > 2, yielding a fraction of 10^^ M© clusters satis- 
fying this selection of p(Ami2 > 2) = 0.07J;gQ3, where the error 
bar is at Icr usi ng binomial sta t istics (Gehrels 1986). 

Following l Oariush et alj ( 120071) we also show in Fig. [3] the 
Ami2 distribution derived from a Monte Carlo simulation in which 
galaxies were drawn at random from a Schechter function with 
M* = —24.5 and q = —1.2, adopted from a fit of the Schechter 
function to the A'-band galaxy luminosity function of the Millen- 
nium semi-analytic catalo gue, and is also consistent with observed 
luminosity functions (e.g. iLin et al.ll2004l) . This simulation allows 
us to identify whether the Ar7ii2 distribution presents any excess 
probability over random statistical sampling of a common under- 
lying luminosity function. Excess probability over random is only 
found at Ami2 ^ 1. We measure the observed probability of a 
cluster to have a luminosity gap of Am,i2 > 1 to be p(Am,i2 > 
1) = 0.37 ± 0.08, compared with the estimated probability based 
on the Monte Carlo simulation of pMc(Ami2 > 1) = 0.13. 
We therefore detect an excess probability over random sampling 
at Ami2 > 1 of '^ 0.24 at ~ 3a significance, and conclude that 
the A7ni2 distribution at Amn > 1 has a physical origin. 

The Ami2 distributions of the sub-samples of clusters for 
which HST, Chandra, and Spitzer data are available are statistically 
consistent with that of the full sample of 59 clusters (Fig.|3]l. Two 
sample Kolmogorov-Smirnov (KS) tests that compare the HST, 
Chandra, and Spitzer sub-samples in turn with the full sample con- 
firm that the probability of the respective sub-samples being drawn 
from a different underlying Ami2 distribution than the full sample 
is P < 0.1% in all cases, with the largest difference between the 
cumulative distributions being D = 0.0748, between the Chandra 
sub-sample and the full sample. 

We also look at how the absolute magnitude of the first and 
second ranked cluster galaxies vary with A7ni2 (Fig. |4j. The lu- 
minosity of the first ranked galaxy increases very slowly with 
A7ni2, remaining in the range —27 <^ Mk <, —26 across 
the full range of Ami2. In contrast, the luminosity of the sec- 
ond ranked galaxy declines from Mk ~ —26 at A7ni2 ~ to 
]^Ik ~ —24 at A7ni2 ~ 2. We characterize these trends by fit- 
ting the following relations to the data: Mk,i — ai + l3iAmi2 
and Mk,2 = 02 + /32Ami2, where the numerical subscripts de- 
note the first and second ranked galaxies respectively. The best- 
fit values are: qi = -26.13 ± 0.02, /3i = -0.22 ± 0.02, and 
Q2 = -26.13 ± 0.02, /32 = 0.78 ± 0.02. Empirically large lumi- 
nosity gap statistics are therefore due to both an over-bright BCG, 
Afx,i(Ami2=2) — MK,i(Ami2=0)~— 0.4, and an under-bright 
second ranked galaxy, MK,2(A7ni2=2)— Mk,2(A771i2=0)~1.6. 
The relative faintness of second ranked galaxies in large luminosity 
gap clusters supports the idea that the growth of dominant BCGs is 
driven by the merging of luminous cluster galaxies with the BCG. 
Indeed the current SFR of BCGs discussed above lends additional 
support - the BCG in a cluster with a luminosity gap of Ami2 = 2 
is 6x more luminous and has a stellar mass of ~ 10^^ M© 
more than the second ranked galaxy. Just two of the four clus- 
ters with A7ni2 > 2 in Fig. |5] host an active BCG. The most ac- 
tive of these A 18 35, is forming stars at SFR = 125Mq yr"^ 
l lEgami et alj|2006l) , and the other, RXJ 2129.6+0005, is forming 
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Figure 4. Absolute K-bsmd magnitude of the first (black circles) and sec- 
ond ranked (red triangles) galaxies as a function of luminosity gap. The 
solid black and red lines show the best-fit straight-line to the data - see 
^3.2l for more details. The horizontal dashed lin e is at M^ = — 24.34, the 
absolute magnitude of an L* galaxy, taken from lLin et alj (2004|). 



stars at SFR = 14M0 yr"^ l lOuillen et al.l2008h . These two BCGs 
would therefore have to form stars continuously at this rate for 
~ 10^ and ~ lO'^" years respectively for their large luminosity 
gap to be caused exclusively by gas cooling and consequent star 
formation. 

Finally, we note that on average second ranked galaxies in 
clusters with Arni2 > 2 have {Mk;i) = —2 4.6 ± 0.6 where 
the uncertainty is the rms scatter around the mean. lLin et al.l 120041) 
measured A/a' = — 24.34±0.01 for L* cluster galaxies at z < 0.1, 
in agreement wi th similar studies of field galaxies and of higher 
redshift clusters JDe Propris et alJl999l : ICole et alj200il) . The dis- 
tribution of luminosities of second ranked cluster galaxies in clus- 
ters with Ami2 > 2 is therefore statistically consistent with them 
being L* galaxies. This contrasts with low mass Ami2 > 2 sys- 
tems, i.e. fossil groups, in that L* galaxies are absent from low 
mass systems. This difference is probably due, at least in part, to 
the relative inefficiency of galaxy merging in massive clusters. 



3.3 Comparing Luminosity Gap with Cool Core Strength 

To explore further the physical origin of large luminosity gaps 
we plot Ami2 versus a, the slope of the logarithmic gas den- 
sity profile at 0.04r5oo, for 41 clusters that have also been ob- 
ser ved with Chandr a in Fig . \5\ The measurements of a are based 
on ISanderson et al.l 's ( 120091) analysis of the Chandra data ( i]2.4| l. 
At Ami2 — the clusters span the full range of cool core 
strengths: —1.2 <^ a <^ —0.1. This dynamic range shrinks to just 
— 1.2 < a < —0.6 at Ami2 > 2 ~ the clusters with large lu- 
minosity gaps also host relatively strong cool cores. We also iden- 
tify star-forming BCGs in Fig. [5] It has long been known that Ha 
emission from the BCG is closely associated with the presence of 
significant central co oling in the clu ster core (e.g. iHeckman 1981; 
ICrawfordetal.ll999h . More recentlv. lSanderson et al.l ( l2009l) found 
in their sample of 65 clusters that Ha emitting BCGs occur ex- 
clusively in those clusters with the most cuspy inner gas density 
profiles (a < —0.85), and where the projected offset between the 



Am.|2 

Figure 5. The gradient of the logarithmic gas density profile at 0.04r5oo 
versus luminosity gap f or 41 clusters that have also been observed with 
Chandra and studied by San derson et alJj 2009). Blue stars correspond to 
clusters with an Ha emitting BCG (see lSanderson et al. 200 9): blue stars 
with a black outline have also been identified as hosting a BCG that is form- 
ing stars at SFR > IOMq yr~^ using Spi'ter/MIPS observations; filled red 
circles denote clusters with BCGs that are not Ha emitters and are forming 
stars at SFR < IOMq yr~ ^ ; open black circles indicate clusters that have 
not been observed with Spitzer. 



X-ray centroid and the BCG is < 0.02r50()- The same is true of 
the five BCGs with star formation rates (SFR) of > IOM0 yr~^, 
based on mid-infrared observations with 5p(?zer/MIPS - this SFR 
corresponds to a flux of ~ ImJy from a BCG at z ~ 0.2. These 
measurements are drawn from the literature JEgami et al J 120061 ; 
lOuillen et al J 120080 and our own measurements using data from 
Cycle 4 (PID:40827, PI: Smith; PID: 41011, PI: Egami) the de- 
tails of which will be published elsewhere (Egami et al., in prep.). 
Fig. [5] therefore confirms that cool core clusters tend to host ac- 
tively star-forming BCGs (e.g. lEdge et al.ll999l ; lE"gami et al.l20"o^; 
lOuillen et alJuOOSi) . However, cool core clusters (a <^ —1) with 
active BCGs (SFR ^ lOM© yr~^, and/or Hq emission) are found 
across the full range of A7ni2 in Fig. [5] 

These results are consistent with the interpretation of large lu- 
minosity gap clusters as objects that formed relatively early, and 
subsequently developed a large luminosity gap through the merg- 
ing of bright cluster galaxies with the BCG. A similarly long pe- 
riod of time - a few Gyr - is required to form a strong cool core 
following cluster formation. Conversely, if all clusters with smaller 
luminosity gaps formed more recently than those with larger gaps, 
and thus have had insufficient time to form a large luminosity gap 
and a cool core, then they should all host relatively weak cool cores. 
However this is not the case. This can be understood if the so-called 
"fossil" status of a large luminosity gap cluster is not the end-point 
of its evolution. If bright (L > L*) galaxies fall into a cool core 
"fossil" cluster, then that cluster would move immediately leftward 
from the bottom right of Fig. |5] As the in-falling system (presum- 
ably a group) reaches the cluster core ~ IGyr later, it may disrupt 
partially or fully the cooling of gas onto the BCG, and cause the 
cluster to move vertically in the Ami2 — a plane. This scenario 
naturally explains the triangular distribution of points in Fig.|5] and 
is consistent with hierarchical infall (i.e. mergers) playing a role in 
regulating cooling in cluster cores. 
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Figure 6. Example isophotal shape profiles. From left to right the BCGs are classified as boxy, disky, pure ellipse and unclassified. The vertical line at the 
left is set to 3x the FWHM of point sources and the one at the right indicates the half-light radius of the BCG. Note that to keep the analysis simple and 
conservative, no flux was ma sked out of the HS T data. So, for example, the BCG in A 1201 was unclassified because of the impact of the gravitational arc at a 
BCG-centric radius of ~ 2" JEdge et alj2003h on the isophotal analysis. 



To place this discussion on a more quantitative footing we 
adopt a strategy that we return to often in ij3]- we split the sample 
into low- (Ami2 < 1) and high-Ami2 {Ami2 > 1) sub-samples 
and perform a two sample KS test on the cumulative distribution of 
the other variable, in this case a. The hypothesis that high-Ami2 
clusters are drawn from the same underlying a distribution as low- 
Ami2 clusters is rejected at just 74% confidence, i.e. slightly over 
Icr significance, based on a maximum difference between the cu- 
mulative a distributions of D = 0.3102. In the absence of a de- 
cisive test, we therefore divide the sample at Ami2 = 1.5, i.e. a 
more extreme value of A?ni2, attempting to identify roughly the 
luminosity gap at which the a distribution diverges from that of 
lower-Ami2 clusters. This time the two sample KS test rejects the 
null hypothesis at 95% confidence - i.e. 2cr - based on a maxi- 
mum difference between the respective cumulative a distributions 
of_D = 0.472. 



3.4 Comparing the Luminosity Gap with BCG Morphology 

We use the high angular resolution HST imaging observations of 
the 45 clusters discussed in i]2.3l to measure the isophotal shape of 
the BCGs in these clusters. The ELLIPSE task in IRAF was used 
to measure the fourth Fourier coefficient (^4) of the light distri- 
bution. This c oefficient indic ates whether the galaxy has a disky 
or boxy shape (Bender 1988). In Fig. [6] we show the B4 profile of 
four BCGs to illust rate the diversity within the sample. Following 
ISender et alj ( 119890 we tried to use the extremum value of B4, (i.e. 
S4,ext in Table[3]l to classify galaxies as either disky (i34,cxt > 0) 
or boxy (i34,oxt < 0). If the B4, profile passes through a station- 
ary point, then the extremum is obtained by finding the maximum 
or minimum value of Bi in the radial range enclosed by 3x the 
FWHM of point sources and the effective radius derived from a 
de Vaucouleurs profile fit. In the absence of a stationary point, the 
extremum value of B4, is the value at the effective radius, under 
the assumption that B4 is a monotonic function of radius. However 
i?4 is in general not a monotonic function of radius for BCGs in 
our sample, even for those with isophotes that have, on average, 
boxy and disky isophotes (Fig.|6](. For these reasons, the isophotal 
shapes of 22 out of 45 BCGs cannot be classified based on i34,cxt- 
We also find some clusters (e.g. A 1204 - see Fig. |6f in which B4, 
is consistent with zero across the full radial range of the data. 

We therefore implement a modified scheme, in which we cal- 
culate the error- weighted mean value of ,84 in the same radial range 



as above, with no weighting of the bins to account for the vari- 
ation of the bin solid angle as a function of radius. BCGs with 
{B4) consistent with zero within the uncertainties were classified 
as elliptical, otherwise BCGs are classified as Boxy or Disky if 
(B4) <0 or {Bi) >0 respectively. Finally, a BCG is "Unclassified" 
if the error on (B4) is comparable with the dynamic range of the 
data, i.e. >0.01. BCG morphologies derived under both Bender et 
al.'s "extremum" scheme and our own "mean" scheme are listed 
in Table [3] along with the Boxy /Disky /Elliptical/Unclassified clas- 
sification based on each method. The respective methods agree on 
morphological classification for 16 of the 22 BCGs for which clas- 
sification was possible under both methods. However, only three 
of the six discrepant BCGs have (B4) and B4,cxt values that for- 
mally disagree between the methods within the quoted uncertain- 
ties - A 521, A 750 and ZwCl 0949.6+5207. The important advan- 
tage of our method is that classification is possible for an additional 
15 BCGs that were unclassifiable under the Bender et al. scheme. 
We therefore adopt (B4) as our measure of BCG morphology for 
all clusters with HST data for the reasons outlined above regarding 
the general absence of clearly defined stationary points and mono- 
tonic behaviour of the B4 profiles. 



In summary, out of 45 clusters, 10 are classified as Boxy, 13 as 
Disky, 14 as Elliptical, and 8 are Unclassified. In Fig|7]we plot {B^) 
versus Ar7ii2, the most striking feature of which is the lack of clus- 
ters with large Ami2 and negative (^4 ) , i.e. boxy BCGs appear not 
to live in large luminosity gap clusters. As in i]3.3l we split the clus- 
ters into low-A77ii2 (A7ni2 < 1) and high-A7Tii2 (Ami2 > 1) 
samples and perform a two-sample KS test. The low- and high- 
Ami2 samples contain 27 and 18 clusters respectively, with a max- 
imum difference between their cumulative (54) -distributions of 
D — 0.3567. The hypothesis that the low- and high-Ami2 samples 
are drawn from the same underlying (-B4) distribution is therefore 
disfavoured at 91% confidence, i.e. 1.7a. Unlike the situation for 
the analysis of the a distributions of high- and low-Ami2 clusters 
in i]3.3l the significance with which the null hypothesis is rejected 
does not increase if the sub-samples are re-defined by splitting the 
full sample at Ami2 = 1.5. This is obvious from a comparison of 
Figs.|5]&|7] and suggests that the relationship between A?ni2 and 
BCG morphology is stronger than between Ami2 and cool core 
strength. 
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Table 3. Results from Isophotal Analysis of Brightest Cluster Galaxies using HST data. 



Cluster 


Effective 
radius 


Extremum Method 

100i34.cxt Classification 


Mean Method 




100 (B4> 


Classification 




(arcsec) 










A 68 


7.9 


-2.0+0.4 


Boxy 


-0.99+0.19 


Boxy 


A115 


4.1 




Unclassified 


+0.96+0.22 


Disky 


A 141 


3.6 


+3.4+0.9 


Disky 


+1.09+0.19 


Disky 


ZwCl 0104.4+0048 


4.1 




Unclassified 


-0.14+0.80 


Elliptical 


A 209 


7.4 


+1.8+0.5 


Disky 


+0.45+0.07 


Disky 


A 267 


11.6 


+3.5+1.2 


Disky 


+0.99+0.17 


Disky 


A291 


7.3 


+2.1+0.6 


Disky 


+0.67+0.24 


Disky 


A 383 


8.3 




Unclassified 


-0.20+0.97 


Unclassified 


RXCJ0331. 1-2100 


3.1 




Unclassified 


-2.69+7.50 


Unclassified 


A 521 


3.9 


-3.1+1.0 


Boxy 


-0.44+0.79 


Elliptical 


A 586 


7.7 




Unclassified 


-0.16+0.64 


Elliptical 


ZwCl 0740+ 1740 


5.5 


+0.1+0.2 


Elliptical 


-0.11+0.06 


Boxy 


A611 


3.5 




Unclassified 


-0.21+0.15 


Boxy 


ZwCl 0839.9+2937 


3.5 




Unclassified 


+0.84+0.50 


Disky 


ZwCl 0857.9+2107 


4.6 


-4.0+1.1 


Boxy 


-1.49+0.61 


Boxy 


A 750 


4.0 


+5.0+1.5 


Disky 


+0.11+1.45 


Unclassified 


A 773 


6.8 


-2.1+0.4 


Boxy 


-0.50+0.18 


Boxy 


ZwCl 0949.6+5207 


7.1 


+2.4+0.7 


Disky 


+0.22+0.30 


Elliptical 


A 901 


3.2 


+1.1+0.3 


Disky 


+0.50+0.10 


Disky 


RXJlOOO.5+4409 


3.1 




Unclassified 


-0.77+0.18 


Boxy 


A 963 


14.3 




Unclassified 


+0.46+0.19 


Disky 


A 1201 


10.8 




Unclassified 


-0.26+0.48 


Elliptical 


A 1204 


8.9 


0.0+0.7 


Elliptical 


-0.10+0.21 


Elliptical 


A 1246 


8.6 




Unclassified 


-0.14+1.78 


Unclassified 


A 1423 


7.4 




Unclassified 


-0.91+60.6 


Unclassified 


ZwCl 1231.4+1007 


4.4 


-1.3+0.5 


Boxy 


-0.56+0.19 


Boxy 


A 1682 


6.2 


0.0+0.5 


Elliptical 


-0.43+0.27 


Boxy 


A 1763 


8.2 


0.0+0.6 


ElHptical 


+0.04+0.08 


Elliptical 


A 1835 


6.8 




Unclassified 


+1.10+0.60 


Disky 


A 1914 


12.2 


0.0+0.5 


Elliptical 


+0.25+0.15 


Disky 


MS 1455.0+2232 


5.0 




Unclassified 


-0.47+0.25 


Boxy 


A 2009 


9.3 




Unclassified 


+0.14+0.19 


Elliptical 


A 2 146 


6.3 




Unclassified 


-0.20+0.50 


Elliptical 


A 2204 


7.9 




Unclassified 


+0.08+1.20 


Unclassified 


A2218 


8.1 


+2.3+0.8 


Disky 


+0.83+0.29 


Disky 


A2219 


8.5 




Unclassified 


+0.83+0.19 


Disky 


A 2254 


9.6 




Unclassified 


-0.03+0.36 


Elliptical 


RXJ 1720.1-2638 


5.5 




Unclassified 


+0.28+1.39 


Unclassified 


A 2261 


7.2 




Unclassified 


+0.39+7.20 


Unclassified 


A 2345 


7.8 


0.0+0.5 


Elliptical 


+0.03+0.09 


Elliptical 


RXJ2129.6+0005 


9.6 




Unclassified 


-0.16+0.60 


Elliptical 


A 2390 


3.5 


+3.5+0.5 


Disky 


+1.42+0.32 


Disky 


A 2485 


3.9 


-0.5+0.6 


Elliptical 


+0.19+0.63 


Elliptical 


A 2537 


5.2 


0.0+0.3 


Elliptical 


-0.11+0.18 


Elliptical 


A 2631 


6.0 


0.0+0.5 


ElHptical 


-0.24+0.12 


Boxy 



3.5 Comparing the Luminosity Gap witli Cluster 
Concentration 

To investigate the possibility that high-Ami2 clusters formed at 
earlier times than Iow-Ami2 clusters, we explore the relationship 
between Ami2 and the shape of the cluster dark matter halos via 
the concentration parameter. In Fig. [8] we plot A7ni2 versus con- 
centration, csoo for th e 41 clusters with available Chandra data 
I Sanderson et alj <2009l) ( i|2.4| l. The csoo — Ami2 distribution is 
similar to the (B4) — Ami2 distribution in that the lower-right of 
both plots is empty, and that clusters with A?ni2 < 1 span the full 
dynamic range in the vertical axis. To quantify this we again per- 
form a two sample KS test, on the Arni2 < 1 and Ami2 > 1 



sub-samples. In this case the low- and high-Ami2 samples con- 
tain 24 and 17 clusters respectively, with a maximum difference 
between their cumulative C500 -distributions of D = 0.2574. Ac- 
ceptance/rejection of the null hypothesis that low- and high- A»ni2 
clusters are drawn from same underlying C500 -distribution there- 
fore have roughly equal probability. However if we modify the def- 
inition of the low- and high-A77ii2 sub-samples by splitting the full 
sample at A7ni2 = 1.5 we are able to reject the null hypothesis at 
~ 1.7cr. We therefore conclude that the C500 — A?ni2 plane qual- 
itatively supports the interpretation of the (54) — A7TI12 plane, 
however statistically this is not decisive. Specifically, clusters with 
a large luminosity gap tend to have a relatively large concentra- 
tion parameter, although there is a curious deficit of clusters with 
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Figure 7. Luminosity gap statistic (Ami2) versus error-weiglited mean 
fourtli Fourier component of the BCG light distribution ((B4)). Positive 
values of (B4) correspond to Disky BCGs; negative values correspond 
to Boxy BCGs; values consistent with zero are consistent with elliptical 
isophotes. Clusters with Ami2 < 1 host BCGs with both Boxy and Disky 
isophotes. In contrast clusters with Ami2 > 1 host only non-Boxy (i.e. 
Elliptical or Disky BCGs). ~ 




Am, 2 

Figure 8. Concentration C500 versus luminosity gap f or 41 clusters fo r 
which the X-ray-based mass profiles are available from lSanderson et alf s 
12009.) analysis of archival Chandra data. 



Figure 9. Luminosity gap versus substructure fra ction measured wit hin 
R < 250/i~^kpc in 10 clusters from our sample bv lSmith et alj j200a) . 



3.6 Comparing the Luminosity Gap with Cluster 
Substructure 

Measurements of the substructure fraction (/sub), ie. the fraction 
of the total cluster mass tha t resides in substr uctures, are avail- 
able for t en of the clusters dSmith et al.l uOOSh from our sample 
of 59. Sm ith et al.t s gravitational lens models include mass com- 
ponents that account explicitly for substructures required to repro- 
duce the observed positions of multiply-imaged background galax- 
ies - these substructures comprise both galaxy group and indi- 
vidual galaxy masses. We plot /sub versus Am,i2 for these ten 
clusters in Fig. |9l revealing a relationship between these quan- 
tities in the sense that clusters with simpler gravitational poten- 
tials (low /sub) have more dominant BCGs (high A7ni2), and vice 
versa. To quantify this relationship, we fit a simple model to the 
data: log /sub = M + J^Ami2, and obtain best-fit parameters of: 
^J. = -0.29 ± 0.15 an d;/ = -0.58 ± 0.11. This result is consis- 
tent with that found bv lRichard et alj OOlObl) , despite the smaller 
aperture of 250kpc used in their study. This consistency arises be- 
cause the typical projected separation of the first and second ranked 
galaxies in our sample is <; 250kpc. We also double-check that the 
Ami2 distribution of the 10 clusters in Fig. |9]is consistent with 
that of the full sample, finding a maximum difference between the 
cumulative A7ni2 distributions of D = 0.2414, indicating roughly 
equal probability of rejection/acceptance of the hypothesis that the 
two samples are drawn from different underlying populations. 



Ami2 > 1.8 and C500 > 1.5. Clusters with lower luminosity gaps 
plausibly comprise both clusters that formed more recently than 
clusters with large gaps - and thus have lower concentration pa- 
rameters - and clusters that used to have a large luminosity gap, 
and thus formed early, and have a higher concentration parameter, 
but that then suffered infall of bright (i > L*) galaxies. Put an- 
other way, the existence of clusters in the top left corner of Fig.[8]is 
consistent with the timescale on which the concentration parameter 
of a cluster may be reset following a cluster-cluster merger being 
long compared with the infall timescale of ~ IGyr. 



3.7 Summary 

We now summarize the comparison of our luminosity gap mea- 
surements with other probes of the structure, and thus the age and 
assembly history of clusters, and discuss the interpretation of these 
results. 

The clearest empirical relationship found is between the 
Ami2 and /sub in the sense that clusters with a dominant BCG 
(Ami2 > 1) have a lower substructure fraction (/sub < 0.1) and 
vice versa. The strong correlation between Ami2 and /sub is in 
stark contrast with the triangular distributions of clusters in the 
Ami2-a, Ami2-(B4), and Ami2-C5oo planes. A simple physi- 
cal interpretation of the A77ii2-/sub relation is that both quantities 
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are sensitive to the same thing. As galaxies and groups of galax- 
ies fall into clusters the light emitted by the galaxies will either 
cause Ami2 to decrease or stay the same, depending on how bright 
the infalling galaxies are. At the same time the total mass of these 
galaxies and the group-scale halos within which they may be em- 
bedded causes /sub to increase. Early studies discussed the idea that 
galaxy groups with Ami2 > 2 may have formed at earlier times 
than groups with Ami2 < 2. However, more recently, a variety of 
studies have shown that both Ami2 and /sub are correlated with 
both the formation epoch of the host dark mat ter halo, and the re- 
cent hierarchical assembly h istory of the halo lOariush et al.ll2007l . 
bOldlSmith & TavloJl2008h . Therefore both theoretical and obser- 
vational studies across a broad range of dark matter halo mass are 
converging on the view that "fossil" status is not an end-point in the 
evolution of galaxy systems that formed early. Rather it is a phase 
that a galaxy system can evolve through if it formed early and then 
suffered minimal hierarchical infall after the formation of a bright 
massive central galaxy. The triangular distribution of clusters in the 
Ami2-a, Ami2-(S4), and Ami2 — C500 planes are all consistent 
with this interpretation, and inconsistent with the idea that fossil 
galaxy systems are evolutionary cul-de-sacs. Specifically, if a clus- 
ter forms early and then sufficient time elapses for a large lumi- 
nosity gap to form via merging of L* gas-rich galaxies to form the 
BCG, and for a cool core to form, then this cluster will reside in 
the top-right comer of Figs. 7 & 8 and the bottom right of Fig. 5. 
If a ^ L* galaxy then falls into the cluster, either on its own or in 
a group, then the cluster would move left-ward in all of Figs 5, 7, 
and 8 as soon as the infalling galaxy system crosses the aperture 
within which Ami2 is measured (in our case 0.4r2oo)- Several Gyr 
later the infalling structure will reach the center of the cluster, and 
its merger with the cluster may be sufficiently energetic to modify 
the strength of the cluster cool core, the shape of the BCG, and the 
concentration of the cluster dark matter halo. In this way, clusters 
can move vertically in Figs. 5, 7, and 8, and produce the observed 
triangular distribution of clusters. 

The interpretation of non-boxy morphologies ((-B4) >0) of 
BCGs in clusters with large l uminosity gaps is an impor tant ele- 
ment of the discussion above. iKhochfar & BurkertI 12003) showed 
that the morphology of early-type galaxies is sensitive to the mor- 
phology (indicative of gas content) of their progenitors and subse- 
quent gas infall. The straightforward interpretation of the observ- 
ables is therefore that dominant BCGs formed from mergers of gas 
rich (presumably spiral) galaxies and/or have accreted gas since the 
last major merger in their assembly history. Formation of dominant 
BCGs from gas rich progenitors is consistent with the early forma- 
tion of these BCGs as discussed above, because at earlier times the 
galaxies from which BCGs formed would have been more gas rich 
than at later times. 

To disentangle the relative contribution of gas rich mergers 
and accretion of gas to the disky shape of some BCGs we plot 
in Fig. [To] a, the slope of the logarithmic gas density profile at 
0.04r5oo versus (-B4). If BCG morphology is strongly influenced 
by gas cooling onto the BCG then one would expect a relationship 
between a and (Bi) in the sense that disky BCGs ({-B4) > 1) 
would live in clusters with a steep central (a < —0.5) gas den- 
sity profile. This is because clusters with steep central gas density 
profiles host a cool core - i.e. a central positive temperature gra- 
dient, absence of an entropy core, and a cooling timescale short 
compared with the age of the universe. However we do not find any 
strong relationship between {B4) and a in Fig. (TO) We divide the 
41 clusters with Chandra data into those with the strongest cool 
cores - Q < —0.9 - and the rest. A two sample KS test on these 
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Figure 10. Strength of the cool core in each cluster, as measured by 
a the slope of the lo garithmic gas density profile at 0.04r5oo from 
ISanderson et alj 120091) versus the (-B4). The absence of a relationship 
between a and (-B4) suggests that disky BCG isophotes are more likely 
caused by such BCGs being formed from mergers between gas rich galax- 
ies than by cooling of gas onto the BCG. The typical error bar on a is 
<0.1. 



two sub-samples yields a maximum difference between the cumu- 
lative {_B4)-distributions of _D = 0.209, indicating roughly equal 
probability of accepting/rejecting the hypothesis that the two dis- 
tributions are drawn from the same underlying distributions. 

In the absence of a strong relationship between cool core 
strength and BCG morphology, we therefore conclude that BCG 
morphology is more sensitive to the gas content of the galaxies that 
merged to form it, than to the subsequent gas accretion history of 
the BCG. This view is consistent with the comparison of a, Ami2 
and BCG activity in Fig. [5] and the discussion of the dependence 
of Mk,2 on Am,i2 in i]3.2l The key point being that the merging 
of luminous cluster galaxies to form the BCG appears to have a 
much stronger influence on luminosity gap than gas cooling and 
subsequent star formation within BCGs. 



4 COMPARISON WITH THEORETICAL PREDICTIONS 

Modern galaxy formation and evolution models contain physical 
prescriptions for many physical processes relevant to the formation 
and evolution of galaxies, including dynamical friction, conversion 
of cold gas into stars during galaxy mergers, and AGN feedback. 
These processes are particularly important in the centres of galaxy 
clusters where they regulate the cooling of gas onto the most mas- 
sive galaxies in the universe - BCGs. However the models were not 
constrained by the luminosity gap distribution; our observational 

results can therefore provide a strong test of t he models. 

We compar e our obs ervations with the iBower et al.l (|200g), 

■analyti( 
models, all of which are based on the Millennium Simulation^^l - 



)mpare o ^ 

ICroton et J] J2OO6.) , and .de Lucia & Blaizol ( 120071) semi-anaMic 
models, all of which are based on the Millennium Simulation^^l - 
a cosmological numerical simulation of dark matter in a volume 



^^ The Millennium Simulation used in this paper was carried out by the 
Virgo Supercomputing Consortium at the Computing Centre of the Max- 
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Figure 12. Absolute K-hand magnitude of the first (black circles) and second ranked (red triangles) galaxies as a function of luminosity gap from the three 
semi-analytic galaxy formation models discussed in [|4] The solid black and red lines show the best straight-line fit to the observatio nal data shown in Fig.|4] 
The horizontal dashed line in each panel is at Mk = —24.34, the absolute magnitude of an L* galaxy, taken from lLin et alj J2004h . 
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Figure 11. Distribution of the observed luminosity gap (black points - 
see also Fig. [3) compai'ed with the same for clusters with -A/viriai > 
5 X 10^* M0, measured within a projected BCG-centric radius of 640kpc 
using the the Millennium simulation-based semi-a nalyt ic galaxy formation 
model s of lSoweretaP J2006l) . ICroton et alj J200d) . and lde Lucia & Blaizoll 
120071) . The error bar on each bin in the theoretical histograms is compara- 
ble with the observational eiTors. 



spanning 500/i~^Mpc containing ~ 10 ^" particles. An important 
difference between the models is that the lBower et alj model im ple- 
ments "quasar" mod e AGN feedback, whereas the lCroton et al.l and 
Ide Lucia & Blaizoj mo dels implement "rad io" mode AGN feed- 
back. We also note that lde Lucia & BlaizoJ compared their model 
predictions with the observed properties of BCGs, however they 
didn't compare with observed luminosity gaps. 

First we select dark matter halos from the Millennium dark 
matter friends of friends catalogue. Within the whole simulated vol- 
ume, 209 hales were found with masses greater than 5 x 10^* M©, 
i.e. above the mass threshold of the observed sample. We then ex- 
tracted galaxies in these 209 halos from the semi-analytic galaxy 
catalogues based on each of the three models. The A'-band lumi- 
nosity gap was computed for each halo within a projected cluster- 



Planck Society in Garching. The semi-analytic galaxy catalogue is publicly 
available at,|http://www.mpa-garching.mpg.de/galfonn/agnpaper| 



centric radius of 640kpc. The predicted luminosity gap statistic dis- 
tributions are over-plotted on the observed distribution in Figjlli 

The observed Ami2 distribution is consistent, within the 
uncertaintie s with a mo notonically declining function of Ami2 
(ED- The iBower et alj model matches this observational result 
well, and the predicted fraction of clusters with the most extreme 
luminosity gaps is p(Ami2 > 2) = 0.02^q(5j^, just ~ 1.3(t be- 
low the observedjraction of p(Ami2 > 2) ^O-Ort^Q-'^ol (EU. In 
contrast, the lCroton et al.l model peaks at Am,i2 ~ 1 — 1.5 - i.e. it 
does not predict a monotonic decline of p(Am,i2) - however it pre- 
dicts p( Am 12 > 2) = 8. 6^9 n% which is in excellent agreement 
with the observations. The lde Lucia & BlaizotI model predicts a yet 
more prominent peak at ATni2 ~ 1 — 1.5, and a yet higher fraction 
of clusters with extreme Ami2,P(Ami2 > 2) = 16.7l3;g%, that 
disagrees with the observations at ~ 2a. 

Following the same approach as in i]3.2l we also decompose 
the predicted Ami2 distributions into the predicted absolute mag- 
nitudes of the first (Mx,i) and second (Mk,2) ranked galaxies 
(Fig.ll2t. The most striking feature of this figure is that the slopes 
of AIk,i versus A7ni2 and Mk,2 versus A7ni2 are much steeper 
and shallower than the observations res pectively in the Bower et al.l 
model. In contrast, the lCroton et al.l and lde Lucia & Blaizoll models 
succeed much better in reproducing the observed trends. Interest- 
ingly, the discrepant tre nds in Mk,i — A7ni2 and A'Ik,2 — Ami2 
within the lBower et al.l model conspire to produce a distribution of 
Ami2 in Fig.|3]that is in good agreement with observations. 

The ab solute magnitudes of BCGs span ~ 2niag in the 
iBower et al.l model, in contrast to the observed range of ~ Imag. 
As BCGs grow, the largest increase in luminosity from purely in- 
gesting another galaxy is a brightening by 0.75mag, i.e. a merger 
between the brightest two galaxies in a cluste r with Ami2 = 0. 
The very large spread in Mk for BCGs in the Bower et alj model 
therefore indicates that the conversion of cold gas into stars is too 
efficient in their model. In the model, most of the mergers that 
form BCGs are between gas poor galaxies. The main source of 
gas for formation of new stars is that which cools from the intr- 
acluster medium. The steep relationship between Mk,i and Ami2 
therefore im plies that AGN feedback in BCGs is too weak in the 
iBower et al.l model. An important caveat on this interpretation is 
that we showed in ij i|3.4| & [T7] that clusters with large luminosity 
gaps (Ami2 > 1) have non-boxy isophotes and therefore likely 
formed from mergers of gas rich galaxies, i.e. probably at higher 
redshift than the BCGs in the model. 

The sh allow slope o f the relationship between Mk,2 and 
A7ni2 in the lBower et al.l model implies that the replenishment of 
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the supply of cluster galaxies that are ingested into their respec- 
tive BCGs is too efficient in this model. Specifically, the difference 
in slopes of AIk,2 — Ami2 between iBower et alj and the other 
two models could arise from differing treatments of the merging 
of galaxies in the respective models following the time at which 
individual galaxy halos lose their identity following ingestion into 
the parent cluster halo. W e also comment, more generally, that the 
galaxies in lBower et aLr s model tend to be less luminous than the 
observed galaxies by ~ 0.5 mag, and those in Croton et al. 's model 
tend to be over-luminous by ~ 0.3 mag. This suggests that the 
strength of feedback in the general cluster population may be too 
strong in the former and too weak in the latter model. 

For completeness, we also compare our measurement of the 
fraction of lO^^Ma clusters th at satisf y Ami2 > 2 with predic- 
tions from lMilosavlievic et ali s ( 120060 anal ytic model. Our mea - 
surement of O.OT^oog is well within 2a of Milosavlievic et ahl 's 
prediction of 0.03. The most obvious difference between their 
model and our observations is that the prediction is calculated 
within the cluster virial radii, in contrast to our calculation within 
a projected cluster-centric radi us of '^ 0.4r2oo. Th e larger volume 
within each cluster probed by iMilosavlievic et al.l will reduce the 
probability of finding clusters with large luminosity gap statistics. 
The same authors also estimate the fraction of 10^^ M© clusters 
with Ami2 > 2 using data from the Sloan Digital Sky Survey 
JMiller et alj|2005n . obtaining a similar fraction to their prediction. 
The possible disagreement between this estimate and our own is 
harder to understand because both use a similar physical aperture 
for the calculation of A7ni2. We note, however, that the two ob- 
served cluster samples are selected in different ways; our sample is 
X-ray selected whilst SDSS is optically selected. 



5 CONCLUSIONS 

We have combined wide-field near-infrared imaging from the 
WIRC camera on the Hale 200in telescope, with HST, Chandra, 
and Spitzer observations of 59 massive galaxy clusters at 2; ~ 0.2 
to explore the connections between the formation histories of BCGs 
and the galaxy clusters that they inhabit. This large statistical sam- 
ple is intended to be representative of the underlying population of 
massive X-ray luminous clusters. Extensive tests confirm that re- 
sults based on this sample can be regarded as statistically compat- 
ible with those from a complete volume-limited sample. Our main 
empirical results are as follows: 

(i) We have made the first observational measurement of the dis- 
tribution of the luminosity gap statistic, A7TI12, of massive ~ 
10^^ M0 clusters. The probability distribution of the luminos- 
ity gap statistic is a monotonically declining function of Ami2, 
well described by the relation p(Am,i2) = A + BAmi2 with 
A = 0.41 ± 0.03 and B = -0.13 ± 0.02. 

(ii) Following lOariush et al.l ( 120070 we used Monte Carlo simula- 
tions to quantify the fraction of clusters with large luminosity gaps 
expected from random sampling of a Schechter function. The ob- 
served distribution exceeds the statistical distribution derived from 
the Monte Carlo simulation at Ami2 > 1 at ~ 3(t significance, 
confirming that the most extreme luminosity gaps have a physical 
origin, and are not statistical flukes. 

(iii) Four of our sample of 59 clusters have extreme luminosity 
gaps of Ami2 > 2 - ZwCl 1309.1+2216, A 1835, A2261, and 
RXCJ2 102. 1-2431 - which equates to a fraction of 10^^ Mq 
clusters that have Ami2 > 2 of p(Ami2 > 2) = 0.07^ 



F0.05 
-0.03- 



(iv) The morphology of 45/59 BCGs was measured by analyzing 
the shape of the BCG isophotes in archival and new HST obser- 
vations of the cluster cores. The split between boxy, elliptical and 
disky isophotes is: 22% boxy, 32% elliptical, 29% disky, with 17% 
unclassified. 

(v) A strong correlation is found between Ami2 and /sub, the 
fraction of mass in the cluster cores associated with group- and 
galaxy scale dark matter halos, the latter com ing from publishe d 
gravitational lens models of the cluster cores dSmith et al.ll2005h . 
The relationship between Ami2 and /sub is parameterized thus: 
log /sub ~ /i + !^A7Tii2, with best fit parameters /i = — 0.29±0.15 
ad!/ = -0.58 ±0.11. 

(vi) Clusters with large luminosity gaps, A7ni2 ^ 1 — 1-5, have 
cuspy gas density profiles, and thus relatively strong strong cool 
cores (a < —0.6, where a is the logarithmic gas density profiles 
at 0.04r5oo), elliptical or disky BCGs ({S4) > 0, where B4, is the 
fourth-order Fourier co-efficient of the optical isophotes), concen- 
trate d dark matter de nsity profiles (C500 > 1, where C500 is based 
on a lHemquistll 1990l model fit to the Chandra data), and small sub- 
structure fraction (/sub <, 0.1, where /sub is based on strong lens 
modeling of the mass distribution). 

(vii) In contrast, clusters with small luminosity gaps, A7ni2 < 1, 
span the full range of observed cool core strengths (—1.3 < a < 
0), span the full range of boxy, elliptical, and disky BCG morpholo- 
gies (—0.015 < (-B4) < —0.015), span the full range of concen- 
trations (csoo ~ — 2.5), and have large substructure fractions 
(/sub ^ 0.1). 

Clusters with Ami2 Jj 1 are therefore a more homogeneous 
population than clusters with Ami2 < 1. The stronger cool cores, 
more concentrated mass distribution, and non-boxy BCGs, all point 
towards high-A7Tii2 clusters forming at early times. Such early for- 
mation is required to allow sufficient time to pass for the BCG to 
ingest (aided by dynamical friction) the bright cluster galaxy pop- 
ulation in order to develop the large luminosity gap, and for the es- 
tablishment of the cool core. The formation of more concentrated 
dark matter halos at earlier times than less-conc entrated halos is a 
generic prediction of cold dark matter theory (e.g. lNeto et al.l2007h . 
The interpretation of disky BCGs is less straightforward, however 
such morphologies can plausibly be interpreted as evidence for the 
last major mergers in a BCG's formation history comprising gas- 
rich galaxies - the presence of gas thus leading to the establishment 
of a disk-like structure in the BCG. This gas-rich merger scenario 
for BCG formation is consistent with the early formation of large- 
Ami2 clusters. 

How can the heterogeneous population of low- Ami2 clusters, 
and more specifically, the fact that some low-A77ii2 clusters have 
strong cool cores, non-boxy BCGs, and high concentrations, be in- 
terpreted within the context of the early formation of high-A»ni2 
clusters? The most natural explanation is that Iarge-Ami2 clus- 
ters can evolve into low-Ami2 clusters when the supply of bright 
cluster galaxies is replenished by episodes of hierarchical infall of 
smaller galaxy systems, such as galaxy groups. Such infall would 
depress A7TI12 and increase /sub immediately that the group en- 
tered the measurement aperture (in this case a clustercentric ra- 
dius of ~ 0.4r2oo), and would modify other cluster properties such 
the cool core strength, BCG morphology, and concentration of the 
mass distribution on longer timescales of several Gyr. The observed 
heterogeneity of low-Ami2 clusters can therefore be explained by 
these clusters comprising both (i) clusters that have formed more 
recently, and thus have a low concentration, haven't had time to de- 
velop a large luminosity gap and cool core, and have a BCG formed 
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from relatively gas-poor mergers, and (ii) clusters that formerly had 
a large luminosity gap, and have suffered hierarchical infall in the 
previous few Gyr. We therefore conclude that a large luminosity 
gap (and large substructure fraction) is a phase through which a 
cluster can evolve if sufficient time elapses between episodes of hi- 
erarchical merging of other galaxies and groups of galaxies with 
the cluster. The large s catter seen in the theoretical age-Ami2 and 
age-/ sub relationships JDariush et alJIlOOTl I2OI0I : ISmith & Tavlon 
l2008i) lend further weight to the view that both the age and the re- 
cent merger history of a cluster contribute to the observed values of 
Ami2 and /sub. 

We also compare our observational results with predictions 
from Millennium simulation-based semi-analytic models of galaxy 
evolution. We find that none of the mo dels can successfull y re- 
produce the observations in their entirety. iBower et al.l ( 120060 suc- 
ceeds best at reproducing the monotonically declining p(Ami2), 
however they predict a relationship between BC G luminosity 
and A7ni2 th at is far too steep. In contrast, both ICroton et al.l 
( 12006.) and .de Lucia & Blaizoj ( 120071) predict that p(Ami2) peaks 
at A rwi2 ^ 1 — 1.5, in disagreement with the observations, 
with de Lucia & Blaizot predicting the more prominent peak, 
[de Lucia & Blaizot also predict p(Ami2 > 2) ~ 0.17, in con- 
trast to_the_obseryed value of p(ATni2 = 0.071qq3. Nevertheless, 
both|Crgtonet_al.'andlde Lucia & Blaizoll match the observed slope 
of the relationship between BCG luminosity and Ami2 very well. 
We discuss the possible causes of these disagreements, and suggest 
that lBower et al.' s model may be too efficient at converting cold gas 
to stars in BCGs, and may also to be too efficient at replenishing 
the supply of galaxies in clusters. 

We also note that semi-analytic galaxy evolution models also 
fail to reproduce o bservational results on high redshift BCGs 
dCoUins et ai]|2009l : iStott et al. 2010 ). Our new results add to this 
picture of the inability of models to reproduce observations of 
BCGs. An important strength of our results is that we do not rely on 
calculations of the stellar mass of BCGs, and thus are insensitive to 
possible systematic uncertainties in stellar mass estimates for ob- 
served BCGs arising from alternative stellar population models. 

Our future work on the hierarchical assembly of clusters at 
z ~ 0.2 will take advantage of the wide-field multi-wavelength 
dataset that we are assembling, including mid/far-IR observations 
with Spitzer and Herschel, joint strong/weak-lens modeling of the 
cluster mass distributions, our spectroscopic redshift survey of clus- 
ter galaxies with MMT/Hectospec, and X-ray observations with 
XMM-Newton and Chandra. 
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